Abstract
The socio-ecological benefits that urban trees provide depend on their health and performance. Excess heat (i.e., Urban Heat Island; UHI) and other urban conditions affect tree physiology with outcomes from enhanced growth to mortality. Resilient urban forests in the face of climate change require species-specific understanding of growth responses. However, to date, studies assessing growth dynamics were primarily based on remote sensing of communities rather than individuals, or relied on labor-intensive methods that can limit the spatial coverage necessary to account for highly variable urban growing conditions. Here, we analyze growth dynamics of common urban tree species over time and across space for Berlin (Germany), a large metropolis, combining dendroecological (temporal) and inventory assessments (spatial). First, we show that annual increments increased across the 20th century for early (i.e., young) growth. Second, we use an approach relying on open inventory data to identify growth potential (diameter) in relation to excess heat across Berlin while accounting for age, potential management effects, and the urban fabric (e.g., planting area; building density, height; available soil nutrients) by applying generalized additive models for the ten most abundant species. Our analyses showed that younger trees may benefit from increased temperatures, while older individuals feature lower growth at greater UHI magnitudes. Furthermore, we show that planting area as well as building density modulate growth responses to temperature. Lastly, we discuss implications for the management of urban trees in the context of climate change mitigation, considering that younger trees are predominantly located at UHI “hot spots” and will undergo the observed age-dependent shift in temperature-growth sensitivity. By relying on increasingly available open data (inventories, urban/environmental data) our approach here is or will be readily transferable to other urban regions, allowing for species-specific growth assessments, while accounting for urban conditions at an individual level.
The manifold ecological and societal benefits urban trees provide (e.g., Roy et al., 2012) depend critically on their health and performance. For instance, trees alter local energy budgets (Grimmond et al., 1996 ; Hertel and Schlink, 2019) through shading and transpiration (Endlicher et al., 2016; Gillner et al., 2015), and therefore can reduce ambient temperatures, infrastructure power-consumption and (human) thermal discomfort (Akbari et al., 2001; e.g. Gulyás et al., 2006; Hoyano, 1988; Mayer and Höppe, 1987). However, excess heat common for cities (i.e., Urban Heat Island, UHI, Oke, 1982), combined with other urban conditions, affects tree physiological functioning with outcomes ranging from enhanced growth to early senescence, branch die-back, and even mortality (Au, 2018; Gillner et al., 2014; e.g., Hilbert et al., 2019). Thus, assessing the effect of increased temperatures on trees, as part of urban green infrastructure, is instrumental for understanding as well as adapting to current and expected conditions in this century (Ward and Johnson, 2007), especially considering ever more urbanized societies and the potential for UHI effects to compound with more frequent atmospheric drought (Brune, 2016; Norton et al., 2015; Roloff et al., 2009).
The UHI effect, i.e., the difference between urban and adjacent rural (air) temperatures, has been intensively studied for several decades (cf. Oke, 1982; Stewart, 2011). It is typically related to the structure and density of urban land-use (Kuttler et al., 2015), which can be characterized through local climate zones, and modulated by physiographic and urban characteristics, such as vicinity to water bodies, predominant wind and street direction, etc. (Stewart and Oke, 2012); yet, the physical basis for the excess heat in cities is to a large extent found in the altered surface energy balance as the proportional cover of vegetation decreases compared to rural (or reference) systems (Hertel and Schlink, 2019; Oke, 1992). In temperate climates, this results in strongest UHI magnitudes at night (cf. Fenner et al., 2014). For example, Berlin features the most intense UHI in Germany due to its large extent and development intensity with an average air temperature increase of around 5 K at night-times (2001-2010) with maxima of up to 11 K (Fenner et al., 2014) in urban \(vs.\) rural areas.
Increased air temperatures due to UHIs can affect tree growth through altering several physiological processes across plant organs directly or indirectly (Dusenge et al., 2019). Generally, reaction times at cellular level increase with temperature up to a maximum, after which a drop in enzymatic activity results in a species-dependent optimum curve (Arcus et al., 2016; Parent et al., 2010). In leaves this optimum response is reflected in the net assimilation rate of carbohydrates, as a balance of photosynthesis and respiration, with losses exceeding gains more rapidly with increasing temperatures (Long, 1991). These responses vary between species (Tjoelker et al., 2001) as well as intra-specifically due to local acclimation, i.e., a shift of optimum temperature responses after prolonged exposure (Yamori et al., 2014), and threshold temperatures before tissue damage occurs (for review see Geange et al., 2021). High temperatures in temperate areas are often coincident with low relative air humidity (i.e., large vapor pressure deficit), which in turn can decrease stomatal conductance governing the majority of gas exchange in leaves (Grossiord et al., 2020), and thus the capacity for photosynthesis. Under prolonged stomatal closure (or decreased conductance) with high temperatures, trees may thus face decreased growth (in subsequent years) or even starvation as their carbohydrate reserves are depleted yet not replenished at sufficient rates (McDowell et al., 2008). Furthermore, air (and soil temperatures) affect the initiation, speed and cessation of cambial activity, and thus radial growth throughout a growing season (e.g., see Begum et al., 2013; Rathgeber et al., 2016). Radial growth is increasingly considered to be limited by wood formation dynamics and their relation with environmental drivers, rather than solely by photosynthetic activity (Körner, 2015). In particular, the availability of soil water is critical for cell expansion (e.g., Peters et al., 2021) and most likely limits radial growth before photosynthesis (Fatichi et al., 2014); however, this water availability is again linked to local climate as higher temperatures drive evaporation and thus may contribute to the depletion of soil water storage, impeding growth.
Several urban tree species have been shown to have tendencies of enhanced growth rates and/or productivity compared to rural conspecifics (e.g., Briber et al., 2015; O’Brien et al., 2012), which is typically attributed to increased temperatures (Jia et al., 2018; Pretzsch et al., 2017), however, a broad range of effect sizes and signs (i.e., reduced growth) specific to species and location are also commonly reported alongside. For instance, Zhao et al. (2016) showed that productivity rates, as a proxy for growth, increased within urban clusters as urbanization intensifies using remotely sensed vegetation indices. Furthermore, Moser-Reischl et al. (2019) identified positive associations between air temperature and radial growth for two species commonly selected by urban planners (Tilia cordata MilL., Rubinia pseudoacacia) in Munich. Similarly, for Berlin, Dahlhausen et al. (2018), identified enhanced growth in highly urbanized environments for T. cordata, the most abundant tree of the city, which they attributed to the UHI effect, while intermediate development intensity was adverse for tree growth. By contrast, Gillner et al. (2014) highlight decreased growth for Acer species (A. platanoides and pseudoplatanus), Platanus x hispanica and Quercus rubra with higher summer temperatures of the preceding year, especially when compounded with drought, in another major German city (Dresden). These differences in growth trends may result from contrasting species-specific responses to increased temperatures, but are indeed affected by other (time-varying) factors and stochastic processes, such as water availability, pollution and road-salt loading, structural impedance by infrastructure, or management, etc. (Pauleit et al., 2002; Quigley, 2004; Randrup et al., 2001; Rhoades and Stipes, 1999).
Several of the aforementioned studies applied space-for-time substitutions or time series comparisons to quantify temperature and excess heat on growth for a given region. However, they typically compared trees grouped using qualitative or summary descriptors of sampling sites, disregarding the spatial and/or temporal variability in location-specific factors noted above. This can hinder the extrapolation from individual sampling sites toward predicting effects across entire urban areas and tree stocks, especially when studies rely on labor-intensive methods, reducing sample sizes and coverage of species and space. A lack of co-located environmental variables (i.e. measured in situ) at pertinent spatial scales can further exacerbate these limitations, for instance, as noted by Wohlfahrt et al. (2019) for air temperature and tree leaf phenology, which may lead to incorrect inferences and interpretations for the role of climate change on growth/productivity. It is thus likely that the varying and even contrasting growth responses observed for urban trees across and within studies are at least modulated by some confounding factors, making the attribution to a single driver, such as excess heat, more difficult and possibly less accurate.
These limitations could be overcome by developing extensive dendroecological surveys (i.e., incremental growth) and/or inventories (single or repeat) combined with pertinent environmental data with adequate spatio-temporal coverage and resolution. Inventories are logistically and financially more feasible, and - together with environmental data - are increasingly more available (e.g. Ossola et al., 2020) due to open data policies and their value being recognized across domains for urban greenspace planning and adaptation (Hansen et al., 2019; Monteiro et al., 2020). Berlin, as one of the greenest cities in Europe, provides an openly accessible tree inventory, with spatio-temporal environmental data sets relevant to tree growth. It features nearly 700000 individuals covering 94 genera and some 600 species and/or cultivars, listing information on location, stem diameter (at breast height; \(DBH\)), and stem height, among other variables, for the majority of street and park trees. For this study, our objective was to assess the impact of excess urban heat, i.e. the UHI effect, on tree growth (\(DBH\)) using this openly available inventory data set, complemented by additional open data sources as well as incremental growth data from tree cores. The assessment relied on flexible statistical models that could capture species and location-specific responses to heat and other urban factors. Specifically, we aimed to (1) assess heat exposure of the most abundant species; (2) determine the impact of (excess) heat on stem growth across tree age classes with a space-for-time substitution; (3) highlight the role of location-specific environmental factors in mediating temperature responses.
Berlin is one of the largest metropolitan areas in Central Europe (892\(~km^2\)) with a population of approximately 3.6 million, and a maximum extent of 38\(~km\) in North-South and 45\(~km\) in East-West directions. It is located in North-Eastern Germany, and lies in the temperate zone with warm-humid climate (Dfb) according to the updated Köppen-Geiger classification (Beck et al., 2018), with mean annual temperature of approximately 10\(^\circ C\) and precipitation of 575\(~mm\) (Tempelhof weather station, DWD). Berlin features low relief (approximately 30\(~m\) to 60\(~m\) with 120\(~m\) at solitary peaks), and is centered around a glacial outwash valley (sands, gravel), bordered by two plateaus consisting of glacial till and clay in the North-East and South, as well as sands in the South-West. The city provides extensive public green space covering around 30\(~\%\) of its area (SUVK, Berlin, 2019), with an extensive urban forest of nearly 700000 publicly-managed trees along streets, in parks and in riparian areas.
Fig. @ref(fig:fig-study_area)
An overview of data used for models, including sources, types, and application, is provided in Table\(~\)2, with detailed descriptions in the following subsections.
Table Table\(~\)2
Berlin’s open data provided tree inventories including species, age, location, and circumference which was transformed into diameter.
Note that only street trees in urban, not rural areas or within green spaces, were considered here, but individual trees may grow along streets adjacent to green spaces and parks of varying sizes.
Implausible observations, possibly from erroneous data entry, were removed.
Additional manual data processing for quality control was done with a bespoke software datacleanr by (Hurley et al., 2022), where obvious outliers or clearly interpolated data were removed.
The latter was deemed necessary, as several observations in multiple city districts were derived by linear relationships (i.e., straight-line), which do not capture the ontogenic growth dynamics of trees, and leave no variation related to variables other than age.
All of these operations were recorded, and can be viewed and reproduced via the supplementary code.
Lastly, observations with unlikely diameter-age combinations were identified via the residuals of a generalized linear model between diameter and age with a Gamma log-link distribution: if individual residuals exceeded seven times the median absolute deviation of all residuals, they were removed.
The median absolute deviation (MAD) is comparable to the inter-quartile range, yet more robust to outliers:
\[\begin{equation} MAD = median(\|X_i - median(X)\|) \tag{1} \end{equation}\]
This approach is considered conservative (see supplementary information), yet all analyses were carried forward with the unfiltered and filtered data - no considerable differences were found, thus subsequent sections are based on the filtered data. Table\(~\)3 shows the binned distribution of genera across age classes. Final samples applied in models were smaller, following the availability of ancillary data for a given observation, and limited to a maximum age of 125 years to increase confidence in reported values, and ultimately model estimates.
Table 3
Temperature and UHI data were summarized temporally either by the provider or manually to provide a characteristic representation of heat loading during summer at different times (morning, afternoon/day, night), from which tree averages (radius of 150\(~m\)) were calculated (Figure\(~\)7). Two urban air temperature and one surface UHI data set were tested as explanatory variables in generalized additive models (see Section GAMs). The air temperatures from the Berlin environmental atlas (EnvAt) are processed model outputs (FITNAH-3D; cf. Gross, 1994) that are representations of typical summer conditions at 0400, 1400 and 2200 hours; these data are provided at city block basis (spatial polygons), from which weighted averages were extracted after rasterizing (\(5~m\) resolution). UrbClim air temperatures are hourly model outputs (100\(~m\) resolution, De Ridder et al., 2015) based on ERA5 re-analyses data (ECMWF) for which observations from the hottest month available (June, 2011) were averaged to hours equivalent to Berlin Environmental Atlas (referred to as Berlin EnvAt) data by using a window of \(\pm~1~\)hour (i.e., 0300 to 0500, etc.). Subsequently, a land-use and land-cover mask (CORINE; European Union, Copernicus Land Monitoring Service 2018, European Environment Agency) was used to define urban and rural/forested areas. Using this mask was deemed reasonable as Berlin’s built-up area has not changed markedly over the past 50 years, i.e., about 52 to 61\(\%\) (Mohamed, 2017). The urban heat loading was then calculated as
\[\begin{equation} UHI_{x,y} = T_{Air_{2m}~x,y} - \overline{T_{Air_{2m}~Rural}}, \tag{2} \end{equation}\]
where \(T\) is temperature (\(^\circ C\)) \(x\) and \(y\) define an urban grid cell. The LandSat-derived surface UHI data set by Chakraborty and Lee (2019) (referred to as LandSat) estimates its measure in a similar fashion and the reader is referred to the detailed description therein; note this data set provides day and night-time averaged UHI estimates at 500\(~m\) resolution, which were extracted for the hottest summer in this record (2007).
**Fig. 7
Following the general approach described above, four ancillary covariates next to a temperature measure were employed in models; these were chosen due to their availability at high spatial resolution and coverage, and/or because their influence on growth was previously identified in literature or their likely impact could be deduced using ecophysiological principles. We included planting bed area and the sum of exchangeable basic cation as a proxy for soil nutrient availability (point extractions), as well as the proportional coverage of local climate zone 6 (LCZ6; open mid-rise urban land cover, see Demuzere et al. (2019) and Stewart and Oke (2012) for details) and adjacent building height (rasterized to \(5~m\) resolution). The latter was chosen as an increase reflects a transition away from densely urbanized areas and had the highest coverage (i.e., within [0-1]) for the processed tree inventory.
We modeled the stem diameter (\(DBH\)) of Berlin’s ten most abundant species (contingent to ancillary data availability) in relationship to their location, age, a measure of (excess) heat (UrbClim by De Ridder et al., 2015; Berlin Environmental Atlas models; LandSat-derived surface urban heat island by Chakraborty and Lee, 2019), and additional environmental covariates with generalized additive models (GAMs, see Section GAMs for details). Covariates were extracted at 150 and 300\(~m\) to infer the impact of reference scale of the urban fabric on tree growth. From all tested models the most suitable (i.e., parsimonious with highest explanatory power) was employed for further analyses.
To contextualize tree growth patterns between age groups derived from Berlin’s inventory data, we drew upon a recently established data set from (Schneider et al., 2021), who sampled several common tree species across a rural-urban gradient. For our purposes, we grouped trees sampled in parks, green spaces and along streets into a single urban category, and focused analyses on these. Two to three cores were extracted at breast height from each tree. These were then prepared using standard dendro-ecological methods (i.e., mounting, sanding, measuring), and cross-dated with TSAP-Win and COFECHA (Holmes et al., 1986), producing mean tree series of incremental growth. Additionally, cambial age of each increment was established by counting years from the inner most ring at the pith (\(a = 0\)) outward; on tangentially bored cores, missing rings to the pith were estimated.
Table 4
We applied hierarchical generalized additive models (GAM) to estimate the relationship of several covariates with stem diamater growth (\(DBH\)). GAMs, as an extension of generalized linear models (Wood, 2017), allow modeling response variables as parametric and non-parametric combinations of smoothed explanatory covariates, and can assume non-normal response distributions. These smooths are constructed by summation of base functions of varying complexity and form, analogous to scatterplot smoothing (Hastie and Tibshirani, 2017), which provides a high degree of flexibility, ideal for fitting ecosystem dynamics which are rarely linear (Pedersen et al., 2019), or correctly represented with deterministic functional forms (e.g. quadratic equations). In general, a GAM can be written as:
\[\begin{equation} E (Y)~=~g^{-1}\left( \beta_0 + \sum_{i = 1}^{n} f_i (x_i) \right), \tag{3} \end{equation}\]
and
\[\begin{equation} y~=~E (Y) + \epsilon, \tag{4} \end{equation}\]
where \(Y\) is taken from an appropriate distribution and corresponding link function \(g\), \(\beta_0\) is the intercept and \(f_i\) represents a smooth function of a predictor (Pedersen et al., 2019), and \(\epsilon \sim \mathcal{N}(0, \sigma ^2)\).
Nested data structures (e.g., city districts) can be accounted for by introducing random effects, while spatial dependence between observations can be accounted for by constructing smoothing functions with, for instance, northings and eastings (cf. Wood, 2017).
All models were implemented in R (R Core Team, 2021) using functions available in the package mgcv (Wood, 2017).
We assessed trends in annual growth dynamics of urban trees across the 20\(^{th}\) century for 1920-1960 and 1961-2001 (Dahlhausen et al., 2018; similar grouping as Pretzsch et al., 2017) with a hierarchical GAM implemented in mgcv::gamm() to leverage auto-correlation structures made available through the package nlme (Pinheiro et al., 2021).
Annual growth was modeled as:
\[\begin{equation} g(\Delta r_i) = f(year_i) + f_j(cambial~age_i) + c_{i,j} + \tau_{i,k} + e_i \tag{5} \end{equation}\]
where \(g()\) is a log-link for \(\Delta r \sim Gamma\), \(\Delta r\) is the annual radial increment for observation \(i\).
A global temporal (by year) and time-dependent (\(j\), \(\leq 1960\) or \(>1960\)) trend in cambial age were implemented with thin plate regression splines (default smoothing function in mgcv); \(c_j\) is a time-group dependent intercept, while \(tau\) represents a matrix of random effect coefficients for \(k\) species to account for differences in growth patterns, and \(e_i = \phi e_{i−1} + \epsilon_i\).
A \(3^{rd}\)-order autocorrelation-moving average (ARMA) correlation structure was applied (i.e., \(\phi(3,1)\)) to account for the dependency of \(\Delta r\) across years for each tree, as is frequently the case for tree growth (e.g., see Fritts and Swetnam, 1989); the detailed implementation for this model is given in the supplemental material code.
\(\Delta r\) was then derived for a range of cambial ages, and averaged for both time groups, allowing a comparison of recent to earlier growth.
We acknowledge that tree cores obtained at breast height do not represent absolute tree age.
However, here they serve as a proxy for growth between young (\(\gtrapprox 1960\)) and older individuals to contextualize growth patterns inferred from the larger-scale tree inventory.
We focused the analyses on the ten most abundant species after data quality control with a minimum of 10000 observations to ensure greatest possible spatial coverage and to increase confidence in estimates, totalling 218329 trees.
| Species | n |
|---|---|
| Tilia cordata | 58803 |
| Acer platanoides | 41090 |
| Platanus acerifolia | 22358 |
| Quercus robur | 20152 |
| Tilia platyphyllos | 16992 |
| Aesculus hippocastanum | 13899 |
| Tilia intermedia | 12926 |
| Tilia intermedia ‘Pallida’ | 11016 |
| Tilia euchlora | 10785 |
| Acer pseudoplatanus | 10308 |
The diameter (\(DBH\)) of these species was modeled using GAMs as follows:
\[\begin{equation} g(DBH_i) = f(x_i,y_i) + f_j(age_i) + f_j(temp_{t,i}, age_i) + f(covariate_i) + c_{i,j} + \tau_{i,k} + \epsilon_i \tag{6} \end{equation}\]
\[\begin{equation} g(DBH_i) = f(x_i,y_i) + f_j(age_i) + f_j(temp_{t,i}, age_i) + c_{i,j} + \tau_{i,k} + \epsilon_i \tag{7} \end{equation}\]
\[\begin{equation} g(DBH_i) = f(x_i,y_i) + f_j(age_i) + f(covariate_i) + c_{i,j} + \tau_{i,k} + \epsilon_i \tag{8} \end{equation}\]
\[\begin{equation} g(DBH_i) = f(x_i,y_i) + f_j(age_i) + c_{i,j} + \tau_{i,k} + \epsilon_i \tag{9} \end{equation}\]
where \(g()\) is a log-link for \(DBH \sim Gamma\), and \(i\), \(j\) are indices for observations and species, respectively, and \(t\) refers to an (excess) heat measure from UrbClim, Berlin EnvAt or LandSat at different times (morning, afternoon/day, night; see Section Temperature/UHI data; \(c\) is a species-dependent intercept, while \(tau\) represents a matrix of random effect coefficients for \(k\) districts to account for differing management regimes across the city. A global spatial smooth \(f(x_i,y_i)\) (representing projected coordinates in UTM) via a Gaussian process (cf. p. 242 in Wood, 2017) was included to account for the spatial structure of observations, which reduced auto-correlation of model residuals considerably (see supplemental information). These were compared to a sub-set of models without a spatial component (cf. Equation\(~\)(7), and Figure\(~\)1 but not further discussed there). We also tested a suite of models without the spatial smooth for comparison.
Furthermore, we implemented the interaction between temperature and age (i.e., \(f_j(temp, age)\)) as tensor smooths (Wood, 2006) to account for the different variable scales (i.e., units); all models were also tested without this interaction using a thin plate regression spline smooth for temperature (not shown in equations above).
The functions \(f_j\) are for species-specific smooths (i.e., with individual smoothness penalties and functional shapes as detailed by Pedersen et al., 2019).
The covariates for planting bed area and soil nutrient availability were log transformed to account for their skewed distribution, improving the estimation of coefficients for their respective basis functions.
Note, that Equation\(~\)(9) was considered as the appropriate null model for interpretations, and a model including all covariates and the temperature-age interaction was tested as a saturated model.
Models were implemented with mgcv::bam() (Li and Wood, 2020; Wood et al., 2017) and readers are referred to the detailed implementation in the supplemental material code.
Considering all combinations of (excess) heat measures and covariates (with point, as well as 150/300\(~m\) extractions), a total of 158 models were applied.
With these models, we derived age and species dependent \(DBH\) averages across a temperature measure from predicted values in 5-year age groups starting at 30, 45, 60, 75, 90
Model selection was based on explained deviance (similar to explained variance) and the Aikake’s information criteria (AIC). The best model, considered having the highest explanatory power (highest explained deviance) and simplest structure (lowest AIC), was chosen used for final analyses. Explained deviance assesses model fit in generalized linear modelling similar to explained variance in ordinary least squares modelling, but relies on a generalized form of residual, i.e., deviance residuals (see Wood, 2017). Deviance for the Gamma distribution is calculated as
\[\begin{equation} D(y, \hat{\mu}) = 2 \cdot \left\{ \frac{y−\hat{\mu}}{\hat{\mu}} − log(\frac{y}{\hat{\mu}}) \right\} \tag{10} \end{equation}\]
where \(y\) is the observed and \(\hat{\mu}\) the expected value.
As the spatial extent and coverage varied between temperature and ancillary data, more complex models (and specifically those including planting bed area and LandSat temperatures) typically also had fewer total observations. While this prevented a full comparison with information-based model selection criteria like AIC, the appropriateness of models that differed only in their implementation of the temperature-age interaction (i.e., \(f_j(temp_i)\) \(vs.\) \(f_j(temp_i, age_i)\)) could be assessed. For this reason, the suite of developed models presented above were limited to comparatively simple structures (i.e., few terms, interactions and restricted number of basis functions), reducing the potential for choosing over-fitted models without formal comparison. AIC is calculated as:
\[\begin{equation} AIC= −2log(L) \cdot 2k \tag{11} \end{equation}\]
where \(L\) is the maximum likelihood estimate, and \(k\) is the number of parameters. Where any two comparable models had a \(| \Delta AIC| > 10\), we considered the model with the lower score more suitable. We chose to carry out the analysis in its current form rather than on considerably smaller but comparable sample sizes across models, to identify the strongest relationships in the existing data. This allowed us to highlight the utility of the approach per se and for Berlin in particular. Lastly, we also report the root mean squared error (\(RMSE = \sqrt{\frac{1}{n} \sum_{i=1}^{n}{(y_i - \hat{y}_i)^2}}\)) and mean absolute error (\(MAE = \frac{1}{n}|y_i - \hat{y}_i|\)) for the final model.
Recently established annual, incremental growth is on average greater as compared to earlier times (i.e., prior 1960; Figure\(~\)8). The contrast is strongest in the first 30 years of cambial development, with clear indication that averages are statistically different up to approximately 22 years (cf. overlap of 95\(~\%\)confidence intervals). The predicted trajectories for both periods follow typical ontogenetic patterns, yet differ in shape. This may be due to inaccurately estimated cambial ages, explaining the largely monotonic decrease for recent growth due to missing the typical (but not always present) initial rise and fall in pith-near stages. This would inadvertently create a left-shift in Figure\(~\)8 for recent growth. Assuming that to be the case, the actual difference in average rates would be greater than presented here.
Figure 8
The null models containing only age to explain diameter captured a large portion of deviance. Including additional covariates thus resulted only in comparatively small increases of predictive skill. The inclusion of temperature (additive structure) and temperature-age interactions improved the predictive skill, with interactions reaching higher explained deviance (i.e., skill). Models including the LandSat-derived UHI measures performed best on average, where models containing \(LCZ6_{\frac{}{150~m}}\) or all covariates (saturated model) did best (Figure\(~\)1), which also holds true for other temperature/UHI measures. Note that non-temperature models containing planting bed area and \(LCZ6_{\frac{}{150~m}}\) alone outperformed several of those including other temperature measures and covariate combinations. This indicated the importance of both covariates, with \(LCZ6_{\frac{}{150~m}}\) also represented in the best-performing model, corroborating its impact on stem diameter development. As \(LCZ6_{\frac{}{300~m}}\) only marginally increased performance compared to the null model (age only), the reference scale for the impact of the urban fabric (i.e., LCZ6) likely has a limit at \(<300~m\). Nutrient availability, similar to \(LCZ6_{\frac{}{300~m}}\), also only improved model performance marginally. Building height, regardless of its reference scale (i.e., \(150\) \(vs\) \(300~m\)), as another proxy for urban fabric, showed minimal to no improvement compared to the age-only null model.
Differences in sample sizes between model structures due to covariate and/or temperature data coverage were considerable (e.g., planting bed area \(vs.\) soil nutrient models for UrbClim UHI). Thus, we considered the model with the overall greatest predictive skill and largest sample for further investigations, namely \(LCZ6_{\frac{}{150~m}}\) for LandSat derived UHI. For this variable combination, the model structure with age and temperature interaction also had a lower AIC (\(AIC~=~8.95 \cdot 10^5\)) compared to the additive one (\(AIC~=~8.98 \cdot 10^5\)).
Figure 1: Overview of model fit, expressed as explained deviance, for all tested models. The Panels distinguish between models without (top) and with (bottom) spatial smooths to account for autocorrelated residuals. Colored areas reflect (inclusion of) temperature and (excess) heat measures. Symbols identify models with an interaction between temperature and age (diamond), and without (circle), while symbol colors highlight covariates and their reference radius (i.e. 150/300\(~m\), but also note x-axis labels). Symbol size indicates the model’s included observations (i.e., sample size). Generally, including temperature (interactions) improve model fits above the null model (age only) and other non-temperature models. However, note the importance ofplanting bed area and LCZ6 cover is apparent even without including temperature. The LandSat-derived UHI measures provide the best fit on average, with the model including a temperature-age interaction and LCZ6 cover scoring highest overall.
The final model, comprising a temperature-age interaction, the covariate \(LCZ6_{\frac{}{150~m}}\) and LandSat derived UHI following Equation\(~\)(6), showed good agreement between predicted and observed \(DBH\) for the majority of observations (Figure\(~\)2; \(R^2~=~0.79\)) resulting in a fairly low MAE of \(6.03~cm\), and a RMSE of \(7.7\cdot 10^{-7}\). While some prediction errors are large (spread in Figure\(~\)2), the predictions are unbiased (see correspondence of 1:1 and least-squares fit in Figure\(~\)2). Thus, we consider using species and age-group averages as appropriate for identifying and discussing relationships between \(DBH\) and other covariates.
Figure 2: Predicted \(vs.\) observed data for the best model according to explained deviance and AIC comparison, based on the LandSat-derived UHI measure (Chakraborty and Lee, 2019) for average summer conditions (2007) and LCZ6 (open to mid-rise, Stewart and Oke, 2012). Hexagons and colors represent x-y bins (i.e., in the observed-predicted space) and their counts (N), and the red line is a least-squares fit. The model captures the mean response well, as indicated by lighter colors along the dashed 1:1 line, and its close approximation by the least squares fit, as well as model metrics (see text label in figure).
Relationships derived via the GAM in Equation\(~\)(6) with LCZ6 as covariate show fairly consistent responses across age groups considering trajectories, yet they differ somewhat in shape (Figure\(~\)3). Generally, younger age groups (30-35) showed increased growth, or little to no impact (Acer platanoides, Acer pseudoplatanus), with stronger UHI magnitude. At older ages, this trajectory gradually shifted toward decreasing stem diameter at higher temperatures. Notably, Platanus acerifolia showed an earlier reversal of it at age group 45-50 as compared to other species. This may be due to the stronger upward curvature towards the estimates’ lower temperature limit (see respective panel in Figure\(~\)3). Although the uncertainty around this edge is comparatively well-constrained, the low number of observations at the end of the continuum does call for caution for further interpretation. The distribution of individual street trees along the UHI continuum, evident in the density estimates in Figure\(~\)3, show that the majority of species investigated here are subjected to intermediate to high UHI levels. Trees in younger age groups are typically further toward the upper end of this continuum. Several species - including A. platanoides, A. pseudoplatanus, Acer hippocastanum, Quercus robur, and Tilia euchlora - have good coverage across the entire UHI continuum (although skewed toward positive values), which increases the confidence in the observed trajectory shift from young to old. Contrastingly, the cultivar Tilia intermedia ‘Palllida’ is predominantly distributed along the upper end of the continuum, resulting in greater uncertainty in the estimated relationship across age groups and UHI magnitudes.
Figure 3: Age-group averaged relationships between the LandSat-derived UHI and stem diameter for the ten most abundant species from the best model (i.e., Equation\(~\)(6) with LCZ6 as covariate). Note that estimates were derived with \(LCZ6_{\frac{}{150~m}} = 0.5\), and that y-axis scales differ between panel rows. The colored lines and bands are the averaged GAM fit, and gray lines are least-squares estimates fitted to these averages. Estimates were only generated within temperature ranges that a given species experienced. Density plots (scaled to match 100\(~cm\) on the y-axis) show the distribution of individual trees along the urban heat continuum, aiding both with interpreting uncertainties (band width, “wiggliness”) and assessing species’ exposure to urban heat. A general shift from increased or stable growth with temperature at younger ages to a decrease for older trees is apparent.
Figure 4: Temperature sensitivity of diameter growth centered on species-specific means for comparability and to aid interpretation of inter-specific differences. Lines and bands are from ordinary least squares fits on the GAM estimates in Fig.\(~\)3 (A) and the corresponding coefficients (slopes, B).
The chosen model also highlighted that, on average, trees growing in less urbanized environments (i.e., more open to mid-rise urban land cover; \(LCZ6_{\frac{}{150~m}}\)) achieve somewhat greater diameters. Note, that planting bed area, similar to \(LCZ6_{\frac{}{150~m}}\), was identified as a covariate increasing explained deviance, both in models with and without a temperature measure, and represents an opportunity for further investigation, especially following additional data collection.
Figure 5: Age-group averaged relationships between the LCZ6 and stem diameter for two \(Tilia\) genera from the best model (i.e., Equation\(~\)(6) with LandSat-derived UHI as excess heat measure). LCZ6 cover was derived as a weighted average within a radius of 150\(~m\) around each tree. Note the identical shape, but difference in intercept as a result of the model specification with a global smooth the non-temperature covariate for all species (see Section Stem diameter model development and selection; the UHI magnitude was fixed at \(3^\circ C\) where most trees are present across age groups. The model identified a steady increase of average stem diameter with open to mid-rise urban land cover.
We showed that initial growth rates (\(\lessapprox 30~a\)) are greater in recently established trees, as compared to older ones. This pattern was found in annually-resolved incremental growth observations, and corresponds with prior work on urban-rural (e.g., Pretzsch et al., 2017; Zhao et al., 2016) and intra-urban comparisons (e.g., Dahlhausen et al., 2018), where it was attributed to increased temperatures - namely, the UHI effect. Furthermore, the space-for-time substitution highlighted that recently established, i.e. younger, trees have greater (positive) sensitivity to heat than intermediate age groups, while the oldest trees showed lower average absolute growth (diameters) with increasing heat. We are not aware of other studies reporting such an age-dependent shift in sensitivity. However, the consistency of this pattern across species provides a satisfactory level of confidence in it, and indeed, Dahlhausen et al. (2018) also report a time - and by implication an age-dependent - shift of temperature sensitivity on growth depending on urban development intensity, but do not further distinguish between age groups. The age-dependent shift may be explained by a combination of physiological and environmental changes over time, and/or may be related to management practices, but we acknowledge that additional analyses is required to disentangle these processes. For instance, higher temperatures may have benefited physiological processes close to their respective optima in both young and old individuals. But, as older trees must support greater leaf area through rooting in similarly small (restricted) soil volumes, water availability and nutrients may become increasingly limiting. Higher rates of transpiration and photosynthesis, driven through temperature increases, could thus turn detrimental as the ratios between soil volume, root and leaf area change throughout a trees life span and in time. In addition to this, younger trees are typically more rigorously managed and irrigated (e.g., Koeser et al., 2014), which may (further) alter age-dependent temperature sensitivity. We lack data on demographic differences in management to confirm this, and acknowledge that younger trees here also surpassed their establishment phase, which entails high mortality during the first five years after planting (Harris and Day, 2017; Hilbert et al., 2019) during which the above would not apply
Generally, the ten investigated species and cultivars showed consistent responses in sign, but differed substantially in magnitude, modulated by age. Aesculum hippocastanum , Tilia intermedia, T. intermedia ‘pallida’ and showed greatest absolute growth at high temperatures at younger (30-35) as well as older ages (60-65). The drought tolerance and performance under high temperatures is well documented for the latter two and features in tree selection frameworks, such as species climate matrices in Roloff et al. (2009) and in Brune (2016), where several studies investigating (hydro)climatic sensitivity were summarized. Contrastingly, Acer pseudoplatanus showed decreased absolute growth with higher temperatures; this corresponds with their classification as not very suitable under (Roloff et al., 2009). Platanus acerifolia and Acer platanoides are considered suitable and very suitable under Roloff et al. (2009), yet show the greatest decline with temperatures here. Indeed, several studies reviewed in Brune (2016) highlight A. platanoides as drought-tolerant. Such contrasting indications are also reported for A. pseudoplatanus, and even report A. hippocastanum as only moderately tolerant, despite performing best here. These differences may be due to idiosyncrasies of planting sites, such as low soil volume and increased potential for water stress, (long-term) interactions with pathogens and herbivores (e.g., Meineke and Frank, 2018), or perhaps related to data quality. Most important, these differences highlight the need for assessments of performance (e.g., as growth) under location-specific conditions, and this study provides a first attempt and insights into temperature sensitivity at the scale of an entire metropolis.
The major strength of this work is the estimation of growth sensitivity of urban trees to heat and other urban conditions at city-scale with little data and processing power requirements.
In particular, differences within and between species across several dimensions (depending on model specification) are readily explored where only tree locations, age, diameter (or any other target) and selected variables are needed to implement this approach.
Indeed, due to the high temporal and spatial variability of controls on growth, and thus variability in responses, such assessments should be developed specifically for a given setting, as well-understood tree characteristics (e.g., Roloff et al., 2009), could be strongly mediated at a given location or time. For instance, if drought hardiness is related to extensive root networks, the limited soil volumes available to street trees could render a species vulnerable to water stress. In addition, studies on climatic suitability and physiological responses are difficult to compare due to local acclimation of species to controls on growth and performance, as well as different methodological protocols (e.g., see Brune, 2016).
Thus, the approach followed here, with its comparatively simple data requirements, can be implemented in a variety of settings to gain initial and/or guiding insights on tree performance in a specific urban region, and also serve as a benchmark to compare other empirical estimates or mechanistic models with.
It may even allow estimating tree performance under future conditions to some extent (e.g., considering increase in temperature alone for this study), but is of course contingent on data coverage (e.g., demographics, environmental gradients).
Furthermore, the implementation of GAM models here with mgcv::bam() is computationally efficient (Wood et al., 2017), making model testing, selection and final application a rapid process even for abundant observations (here \(n >150000\) and processing times of \(t < 60~s\) on a desktop computer for the selected model).
Future research may focus on collecting additional data (e.g., increasing the coverage on planting bed area) and subsequently deriving species-specific smooths for ancillary environmental covariates.
Two noteworthy caveats of the approach followed here lie in its empirical, data-driven nature, and its dependence on data quality. The presented results here are correlative. While their interpretation follows first principles and may also allow anticipating the performance of a given tree under a range of increased temperatures, insights derived from mechanistic understanding are indispensable. This is because a change in one environmental condition, e.g., temperature or pest abundance, may also entail alterations to other drivers, and ultimately, affect physiological processes in ways not yet captured in existing data. Furthermore, the data provided in Berlin’s tree inventory required extensive quality controlling, and it is likely that inaccurate (diameter, age) or interpolated (diameter) observations were including in the analyses. However, the minimum requirement of 10000 observations per species, and the focus on average diameter responses across age groups decreased potential impacts of such observations may have had to a degree that was considered satisfactory, while uncertainty is communicated effectively (i.e., larger confidence bands for T. intermedia ‘Pallida’). Further, the utility and transferability of the approach per se were considered of value to the community and practitioners.
Tree species selection for urban forests and urban greening depends on several often competing factors - such as aesthetic appeal and ecosystem service provision (e.g., Roman et al., 2020) - and has been focused on identifying trees suitable for current and future conditions for the last decades with increasingly strong evidence basis (e.g., Hirons and Sjöman, 2019). Here, biogeographical approaches aim to match natural (i.e., non-urban) species distributions within current bioclimatic envelopes with future conditions at a planting site (e.g., Broadmeadow et al., 2005). Watkins et al. (2020) used such “climate matching” to identify the space along environmental dimensions (heat, precipitation) which urban regions inhabit, falling within or extending the natural envelope. Our work extends this approach by identifying the heat space (i.e., UHI range) occupied by several key species within a city. This not only allows identifying species-specific and intra-specific (age-dependent) sensitivities to urban conditions, but can also inform how age cohorts may respond as temperatures at a given location shift along the present continuum in the future, while accounting for the impacts of additional urban conditions (e.g., management, disturbance, urban fabric). Notably, increased growth rates at higher temperatures, as reported here, also entail greater demographic turnover through mortality (Smith et al., 2019). Thus, as the majority of individuals were situated at the upper end of the temperature continuum, high temperature sensitivity may entail additional management due to faster turn-over and potentially subsequent replacement by more appropriate species. The insights derived from this study, and potentially others following this approach in the future, can therefore add another dimension to tree selection approaches implemented down to individual trees and locations. It thus complements quantitative and qualitative tools, such as the climate matching tool, ecological site classification or species-climate matrix [see Hirons and Sjöman (2019); Roloff et al. (2009); Brune (2016); and references therein].
Urban tree databases may only represent a single point in time (e.g., see https://opentrees.org), despite repeated inventories at regular and/or irregular intervals within cities or their districts. However, storing and providing multi-temporal inventories, rather than over-writing existing records, would give planners and researchers a wealth of additional information on individual tree performance, which – even at low temporal coverage – would increase confidence in space-for-time substitutions, or make them superfluous. In this regard, it is recommended to enhance and standardize (meta) data collection and provisioning to facilitate data quality control, thereby increasing confidence in the data, and enabling synthesis studies across districts and cities (Ossola et al., 2020). Key information should include (1) location, absolute age and planting date, (2) management regimes (e.g., planting regime, crown pruning, irrigation), (3) data collection protocol and body (e.g., actual measurement \(vs.\) estimate or interpolation), as well as (4) tree provenance to account for potential differences in ecotype and/or acclimation (Watkins et al., 2020). In addition, managers and planners also should be aware of potential scale-limit of the urban fabric (e.g., \(LCZ6_{\frac{}{150~m}}~vs.~LCZ6_{\frac{}{300~m}}\)) on tree performance when altering surroundings or planting in a given area; efforts should be directed toward identifying the most pertinent environmental covariates and how their influence scales in space.
Finally, open data was key for this study. We expressedly urge authorities to implement FAIR (findable, accessible, inter-operable and reusable) data principles to enable work similar to ours, as well as the creation of larger-scale databases and/or execution of synoptic studies (e.g., Ossola et al., 2020).
We showcased the application of a GAM-driven analyses to identify the sensitivity of urban tree growth to environmental and urban drivers in a space-for-time framework. The extensive data coverage (tree inventory and covariates) allowed us to account for location-specific factors and identify the most influences on growth potential (i.e., maximum diameter) for individual species. The identified age-dependent shift of temperature sensitivity, which was fairly consistent across species, as well as inter-specific differences thereof, will inform and guide further research. Notably, increasing the confidence in and coverage of data collection on less abundant species is key to maximize the potential of the approach developed here. Our results are a contribution toward Berlin’s current and future management of its tree stock and may help drive adaptation to climate change. Despite being a case study for a single city, we believe our work may provide a flexible approach for other cities with available or growing inventories, as well as ancillary environmental data, and may also inform the use of other planning tools, such as species-climate matrices (Roloff et al., 2009) regarding temperature sensitivity.
AGH and IH were supported through the Helmholtz-Climate-Initiative (HI-CAM), funded by the Helmholtz Initiative and Networking Fund; the authors are responsible for the content of this publication.
Figure 6 Berlin’s generalized land-use derived from SUVK, Berlin (2019) and location within the European context (inset).
Figure 7 Overview of data sets used for assessing the relationship between temperature and tree growth, with data sources in rows and averaged time intervals in columns. Note the varying spatial coverage and resolution as noted in Table 2.
Figure 8 Rates for annual incremental growth differ between recent and earlier times, as predicted by a hierarchical GAM (see Section Dendrochronological analyses. Thick lines and bands are for mean and 95\(~\%\) confidence interval of all annual predictions across a time group (fine lines). On average, early stages of recent growth (up to approximately 22 years) exceed that of earlier periods discernibly.
Figure 6: PLACEHOLDER
Figure 7: PLACEHOLDER
Figure 8: PLACEHOLDER
| Name | Accessed | Type | Unit | Resolution | Radius | Source | Reference |
|---|---|---|---|---|---|---|---|
| Street Trees | Oct ’20 | Point | https://daten.berlin.de/ | ||||
| UHI Berlin | Dec ’19 | Raster | \(^\circ C\) | 500 | 150 | https://yceo.yale.edu/research/global-surface-uhi-explorer | Chakraborty et al. (2019) |
| UHI Berlin | Dec ’19 | Raster | \(^\circ C\) | 500 | 150 | https://yceo.yale.edu/research/global-surface-uhi-explorer | |
| Berlin Climate Model, Air temperature 2015 (Umweltatlas) | Feb ’21 | Polygon | \(^\circ C\) | 5 | 150 | https://daten.berlin.de/ | |
| UrbClim ERA5 Model Output (ECMWF, UCSC) | Mar ’21 | Raster | \(^\circ C\) | 100 | 150 | https://cds.climate.copernicus.eu/ | Deridder et al. (2015) |
| Berlin Land-use | Apr ’21 | Polygon | https://daten.berlin.de/ | ||||
| Copernicus CORINE CLC | Mar ’21 | Raster | 100 | https://land.copernicus.eu/ | |||
| WUDAPT LCZ | Oct ’20 | Raster | 100 | 150/300 | https://www.wudapt.org/continental-lcz-maps/ | Demuzere et al. (2019) | |
| Berlin Veg/Building Height | Oct ’20 | Polygon | \(m\) | 5 | 150/300 | https://daten.berlin.de/ | |
| Berlin Soil Nutrients, Bodenkundliche Kennwerte 2015 (Umweltatlas) | Nov ’20 | Polygon | \(mol~m^{-2}\) | 0 | https://daten.berlin.de/ | ||
| Planting Bed Area | Oct ’20 | Polygon | \(m^2\) | 0 | https://daten.berlin.de/ | ||
| Berlin Soils | Oct ’20 | Polygon | 0 | https://daten.berlin.de/ | |||
| Berlin Districts | Oct ’20 | Polygon | https://daten.berlin.de/ | ||||
| Berlin Transport Network | Feb ’21 | Polygon | OpenStreetMap Overpass API | ||||
| Berlin Water (Ways) | Feb ’21 | Polygon | OpenStreetMap Overpass API |
| Genera | (0,30] | (30,60] | (60,90] | (90,120] | (120,150] | 150+ | Total (n) | Missing (n) |
|---|---|---|---|---|---|---|---|---|
| Tilia | 40128 | 60854 | 34599 | 4390 | 120 | 11 | 140232 | 130 |
| Acer | 23306 | 33771 | 10220 | 1798 | 62 | 17 | 69330 | 156 |
| Quercus | 8686 | 16107 | 5721 | 2595 | 562 | 157 | 33873 | 45 |
| Platanus | 4467 | 11836 | 4784 | 1449 | 805 | 68 | 23425 | 16 |
| Aesculus | 4464 | 7064 | 5566 | 1211 | 91 | 25 | 18427 | 6 |
| Betula | 2469 | 7155 | 897 | 36 | 2 | 1 | 10572 | 12 |
| Fraxinus | 4324 | 3332 | 742 | 131 | 6 | 0 | 8543 | 8 |
| Robinia | 2494 | 4523 | 857 | 83 | 3 | 1 | 7975 | 14 |
| Carpinus | 3905 | 2349 | 176 | 4 | 0 | 0 | 6466 | 32 |
| Prunus | 3792 | 2121 | 111 | 12 | 0 | 0 | 6067 | 31 |
| Populus | 639 | 3559 | 991 | 279 | 17 | 14 | 5515 | 16 |
| Pinus | 422 | 1349 | 463 | 27 | 0 | 1 | 2269 | 7 |
| Other | 22337 | 12620 | 1799 | 448 | 61 | 17 | 37554 | 272 |
| Marg. Totals | 121433 | 166640 | 66926 | 12463 | 1729 | 312 | 370248 | 745 |
| Location | Species | n |
|---|---|---|
| Alpenrose | Quercus robur | 15 |
| Grünanlage Britz-Süd | Fagus sylvatica | 17 |
| Grünanlage Britz-Süd | Pseudotsuga menziesii | 17 |
| Grünanlage Britz-Süd | Fraxinus excelsior | 14 |
| Grünanlage Britz-Süd | Pinus sylvestris | 16 |
| Grünanlage Britz-Süd | Larix decidua | 16 |
| Grünanlage Britz-Süd | Tilia Cordata | 16 |
| Grünanlage Britz-Süd | Quercus robur | 15 |
| Grünanlage Britz-Süd | Quercus petraea | 21 |
| Hasenheide | Quercus robur | 12 |
| Hasenheide | Quercus robur | 14 |
| Spielplatz Weigandufer & Wildenbruchplatz | Fraxinus excelsior | 19 |
| Werrastraße | Fraxinus excelsior | 12 |